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AN APPLICATION OF STATISTICAL METHOD 



F. A. DEWEY 
Bryn Mawr, Pennsylvania 



Statistics, from the standpoint of the social sciences, has 
remained largely a field for hope and for study. Articles appear 
from time to time explaining some of the elementary processes. 
Articles also mention that the development of statistical method 
carries with it hope for many of the less exact sciences. One 
cannot escape a feeling that some great statistical formula or 
method is eventually to be found which will do away with 
all the uncertainties or doubts that beset the path of social 
science. 

Whether or not such a formula or set of formulae is ever to 
appear must for various reasons remain doubtful. One obstacle is 
the difficulty of dealing with mathematics that involves more than 
a few unknowns of any complexity. Another is the difficulty of 
putting human phenomena and the common experiences of life into 
mathematical form. Neither difficulty, however, need stand in 
the way of the worker who is eager to see a more exact representa- 
tion of fact in the field which is covered by the social sciences, 
whether it be sociology, economics, social technology or economy, 
or any other. Not only can observed facts be summed up pre- 
cisely through the use of existing statistical methods, but the use 
of these will in itself lead to a more careful gathering of data, and 
to a much more careful presentation of the data after the facts 
are gathered. 

To illustrate, it is possible to turn to a field where the data are 
unquestioned and where the use of a statistical summary would 
have made the methods of presenting the data scientifically unim- 
peachable and precise. In November, 1914, for example, there 
appeared 1 a tabulation relating the rent which Lithuanians in Chi- 

1 American Journal of Sociology, November, 1914, p. 297. 
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cago paid for apartments and the number of rooms in the apart- 
ments. The table was as follows: 

TABLE V 

Number of Apartments tor Which Specified Monthly Rentals Are Paid and 
Number of Rooms in Apartment 
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The author of the article accompanying the table proceeds: 
"Before leaving Table V it remains to see whether there is any 
apparent relation between the rate of rent and the number of rooms 
obtained. Little connection is noticeable. For example, the cost 
of two-room apartments ranges from less than $4.00 monthly to 
between $10.00 and $11.00 " 

The author nowhere notes that 19 of the 28 two-room apart- 
ments rented for between $4 . 00 and $6 . 00 per month, nor was any 
coefficient of correlation calculated. The work of calculating the 
coefficient of correlation is a simple task and would obviate a clumsy 
and not altogether accurate statement like the one made. And 
furthermore the table, if it had appeared with an accurate mathe- 
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matical coefficient showing the relations between rooms and rent, 
could not have appeared with foreshortened classes. Of the 790 
apartments rented 12 per cent are in one of the last three classes 
(rent unknown, free, or including a store) or else in undefined 
classes ("less than $4.00" rent; "$15.00 and less than $30.00," 
"$30.00 and over," rent; "7 or more" rooms). The uncertainties 
introduced by the foreshortening account for 74 of the 130 apart- 
ments which cannot be classified. 

As the data are presented, it is only possible for the person to 
whom the original data are inaccessible to calculate the coefficient 
of correlation for that part of the table which is complete; that is, 
to obtain it for the 660 apartments which paid at least $4 . 00 rent 
and less than $15 . 00 rent. The inclusion of the five apartments in 
this part of table which have "7 or more" rooms as 7 rooms does 
not greatly affect the coefficient of correlation. As the inclusion 
decreases the coefficient, they are included. 



Rent per Month 
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SixSiy 
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The summary of the work according to the product-moment 
(Pearson's) 1 method is as shown in the table above. The product- 
moments are positive, as given, and are summed for each rent 

1 Cf . Yule, An Introduction to the Theory of Statistics. 
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row, the negative products being subtracted in each row before 
the total is set down. 



Number of Rooms 
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7 and over 
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The coefficient of correlation is then 



\(45<>- 



<«->-)(3S36-^ 



660 






or 



=0.0157, 



821.2 
1/441. 7X 3 8 3 o =+0 - 632 - 
And the probable error of the coefficient is 

o.67(i-o.63 2 ) = o.67(o.3969) = f 
I/660 25.7 

which makes the correct expression of the relation between the 
number of rooms obtained for a given rent the following: 
r= +0.632 ±0.016. 

Since the coefficient is applicable to a part only of the table, it 
is obvious that for a part of the fiats there is some connection 
between the number of rooms and the rent; location, repairs, 
position, etc., in other words, are not the predominating factors in 
rent for a part of the table. 

In another article, which appeared in January, 1913, 1 a tabu- 
lation relating rent and the number of rooms in apartments is from 

1 American Journal of Sociology, p. 522 for date given. 
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the scientific standpoint a greater offender than the one just 
quoted. Rooms are classified only as far as "6 and over," and 
rent to "$15 . 00 and over." Sixteen and eight-tenths of the rented 
apartments are indefinitely classified because of this fact. Here, 
using the available part of the table, but counting 6 rooms and 
over as 6 rooms, the coefficient of correlation by the product- 
moment method is 0.786 ===0.008 (1,082 apartments included). 

On p. 538 of the latter article another tabulation of the same 
character appears. In this a complete tabulation of rooms is given, 
though rent stops at "$i 2 . 00 and over." Only 1 1 . 8 per cent of all 
the apartments (123) are incapable of definite classification. The 
coefficient of correlation for those which can be definitely classified 
(104) is 0.463=1=0.050. 

In the text the median rental for 6-room apartments was found 
to be much higher than that for the district in which the coefficient 
of correlation was 0.786. The author states that "it is perhaps 
hardly fair to make use of the median rental for 4-room apartments, 
since we have the facts for only 20 such apartments" (p. 536). 
The relation of any apartment of a given size to its rent can be 
expressively stated by the proper coefficient of correlation, i.e., 
0.463 ===0.050. Inasmuch as the true coefficient probably lies 
somewhere between 0.413 and o. 513, comparisons with other data 
are obviously not such as would inspire confidence. 

This is a brief illustration of one way of many in which common 
statistical method can be put to good use in the field of social 
technology or economy. The service rendered is not alone on the 
side of accuracy. The use of the method reacts back into the data 
gathered, the method of gathering it, and its presentation. The 
application of the coefficient enables anyone to test the value of 
work being done long before it is necessary to make a final summary 
and draw conclusions. 

There is a further, though perhaps minor, consequence of the 
use of precise expressions. The expressions themselves gain cur- 
rency and gradually acquire meaning in a world outside, which 
might well become aware of the possibility of accurate statement. 



